Multivariate methods for identifying differentially expressed genes
نویسندگان
چکیده
Motivation: Univariate testing procedures remain the most common way to identify differentially expressed genes (DEGs). Univariate techniques suffer from the multiple comparison problem and reduced power, because they fail to account for gene interaction. Motivated by these issues, we adopt a multivariate procedure. Namely, we utilize the sup-norm test, which was specifically developed for high dimensional, low sample size problems. We propose an algorithm which repeatedly applies the sup-norm test to screen for DEGs. Results: We evaluate our methodology with both simulated and experimental data. Our simulation studies establish the validity of the sup-norm statistic in terms of Type I error and power. With simulated data sets, the screening algorithm retains the majority of DEGs under a variety of experimental conditions. We also used our methodology to analyze the publicly available ApoAI knockout data set. Our algorithm identified the biologically significant genes, as discussed by other authors (Callow et al., 2000; Smyth, 2004). Availability: A set of Matlab functions used to implement the proposed methodology is available at Bioinformatics online. Contact: [email protected]
منابع مشابه
Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis
MOTIVATION The field of microarray data analysis is shifting emphasis from methods for identifying differentially expressed genes to methods for identifying differentially expressed gene categories. The latter approaches utilize a priori information about genes to group genes into categories and enhance the interpretation of experiments aimed at identifying expression differences across treatme...
متن کاملThe Application of a Non-Radioactive DD-AFLP Method for Profiling of Aeluropus lagopoides Differentially Expressed Transcripts under Salinity or Drought Conditions
Aeluropus lagopoides is a salt and drought tolerant grass from Poaceae family, distributed widely in arid regions. There is almost no information about the genetics or genome of this close relative of wheat that stands harsh conditions of deserts. Differential Display Amplified fragment length polymorphism (DD-AFLP) led to the improvement of a non-radioactive method for which many parameters we...
متن کاملAn investigation of two multivariate permutation methods for controlling the false discovery proportion.
Identifying genes that are differentially expressed between classes of samples is an important objective of many microarray experiments. Because of the thousands of genes typically considered, there is a tension between identifying as many of the truly differentially expressed genes as possible, but not too many genes that are not really differentially expressed (false discoveries). Controlling...
متن کاملIdentification of key genes and pathways involved in vitiligo vulgaris by gene network analysis
Background and Aim: Vitiligo vulgaris is an acquired, chronic skin and hair condition characterized clinically by loss of melanin, which, if untreated, is typically progressive and irreversible. The aim of the present study was to identify potential genes involved in the pathogenesis of vitiligo. Methods: One dataset of mRNA expression in patients with vitiligo (GSE65127) were obtained from ...
متن کاملComparison of methods for identifying differentially expressed genes across multiple conditions from microarray data
Identification of genes differentially expressed across multiple conditions has become an important statistical problem in analyzing large-scale microarray data. Many statistical methods have been developed to address the challenging problem. Therefore, an extensive comparison among these statistical methods is extremely important for experimental scientists to choose a valid method for their d...
متن کاملIdentification and Functional Prediction of Long Non-Coding RNAs Responsive to Drought stress in Lens culinaris L.
Drought stress is one of the main environmental factors that affects growth and productivity of crop plants, including lentil. In the course of evolution evolution, crucial genetic regulations mediated by non-coding RNAs (ncRNAs) have emerged in plant in response to drought and other abiotic stresses. In the present study, after identifying lncRNAs within the expression profile of lentil, RNA-s...
متن کامل